Dataset statistics
| Number of variables | 29 |
|---|---|
| Number of observations | 2085494 |
| Missing cells | 17849110 |
| Missing cells (%) | 29.5% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 2.0 GiB |
| Average record size in memory | 1.0 KiB |
Variable types
| DateTime | 2 |
|---|---|
| Categorical | 6 |
| Unsupported | 1 |
| Numeric | 8 |
| Text | 12 |
NUMBER OF PEDESTRIANS KILLED is highly imbalanced (99.6%) | Imbalance |
NUMBER OF CYCLIST INJURED is highly imbalanced (92.3%) | Imbalance |
NUMBER OF CYCLIST KILLED is highly imbalanced (99.9%) | Imbalance |
CONTRIBUTING FACTOR VEHICLE 4 is highly imbalanced (90.8%) | Imbalance |
CONTRIBUTING FACTOR VEHICLE 5 is highly imbalanced (89.9%) | Imbalance |
BOROUGH has 648930 (31.1%) missing values | Missing |
ZIP CODE has 649184 (31.1%) missing values | Missing |
LATITUDE has 234300 (11.2%) missing values | Missing |
LONGITUDE has 234300 (11.2%) missing values | Missing |
LOCATION has 234300 (11.2%) missing values | Missing |
ON STREET NAME has 443414 (21.3%) missing values | Missing |
CROSS STREET NAME has 789605 (37.9%) missing values | Missing |
OFF STREET NAME has 1734453 (83.2%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 2 has 324030 (15.5%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 3 has 1936332 (92.8%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 4 has 2051785 (98.4%) missing values | Missing |
CONTRIBUTING FACTOR VEHICLE 5 has 2076357 (99.6%) missing values | Missing |
VEHICLE TYPE CODE 2 has 399976 (19.2%) missing values | Missing |
VEHICLE TYPE CODE 3 has 1941769 (93.1%) missing values | Missing |
VEHICLE TYPE CODE 4 has 2052958 (98.4%) missing values | Missing |
VEHICLE TYPE CODE 5 has 2076637 (99.6%) missing values | Missing |
LATITUDE is highly skewed (γ1 = -20.39832654) | Skewed |
NUMBER OF PERSONS KILLED is highly skewed (γ1 = 33.61661618) | Skewed |
NUMBER OF MOTORIST KILLED is highly skewed (γ1 = 54.56006989) | Skewed |
COLLISION_ID has unique values | Unique |
ZIP CODE is an unsupported type, check if it needs cleaning or further analysis | Unsupported |
NUMBER OF PERSONS INJURED has 1607054 (77.1%) zeros | Zeros |
NUMBER OF PERSONS KILLED has 2082457 (99.9%) zeros | Zeros |
NUMBER OF PEDESTRIANS INJURED has 1972076 (94.6%) zeros | Zeros |
NUMBER OF MOTORIST INJURED has 1780333 (85.4%) zeros | Zeros |
NUMBER OF MOTORIST KILLED has 2084302 (99.9%) zeros | Zeros |
Reproduction
| Analysis started | 2024-05-07 03:12:49.209694 |
|---|---|
| Analysis finished | 2024-05-07 03:13:48.986679 |
| Duration | 59.78 seconds |
| Software version | ydata-profiling vv4.7.0 |
| Download configuration | config.json |
CRASH DATE
Date
| Distinct | 4325 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.9 MiB |
| Minimum | 2012-07-01 00:00:00 |
|---|---|
| Maximum | 2024-05-03 00:00:00 |
CRASH TIME
Date
| Distinct | 1440 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 15.9 MiB |
| Minimum | 2024-05-06 00:00:00 |
|---|---|
| Maximum | 2024-05-06 23:59:00 |
BOROUGH
Categorical
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 648930 |
| Missing (%) | 31.1% |
| Memory size | 127.9 MiB |
| BROOKLYN | |
|---|---|
| QUEENS | |
| MANHATTAN | |
| BRONX | |
| STATEN ISLAND |
Length
| Max length | 13 |
|---|---|
| Median length | 9 |
| Mean length | 7.4536603 |
| Min length | 5 |
Characters and Unicode
| Total characters | 10707660 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | BROOKLYN |
|---|---|
| 2nd row | BROOKLYN |
| 3rd row | BRONX |
| 4th row | BROOKLYN |
| 5th row | MANHATTAN |
Common Values
| Value | Count | Frequency (%) |
| BROOKLYN | 457165 | |
| QUEENS | 385197 | |
| MANHATTAN | 321417 | |
| BRONX | 212475 | 10.2% |
| STATEN ISLAND | 60310 | 2.9% |
| (Missing) | 648930 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| brooklyn | 457165 | |
| queens | 385197 | |
| manhattan | 321417 | |
| bronx | 212475 | |
| staten | 60310 | 4.0% |
| island | 60310 | 4.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1818291 | |
| O | 1126805 | |
| A | 1084871 | |
| E | 830704 | 7.8% |
| T | 763454 | 7.1% |
| R | 669640 | 6.3% |
| B | 669640 | 6.3% |
| L | 517475 | 4.8% |
| S | 505817 | 4.7% |
| Y | 457165 | 4.3% |
| Other values (9) | 2263798 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 10707660 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| N | 1818291 | |
| O | 1126805 | |
| A | 1084871 | |
| E | 830704 | 7.8% |
| T | 763454 | 7.1% |
| R | 669640 | 6.3% |
| B | 669640 | 6.3% |
| L | 517475 | 4.8% |
| S | 505817 | 4.7% |
| Y | 457165 | 4.3% |
| Other values (9) | 2263798 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 10707660 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| N | 1818291 | |
| O | 1126805 | |
| A | 1084871 | |
| E | 830704 | 7.8% |
| T | 763454 | 7.1% |
| R | 669640 | 6.3% |
| B | 669640 | 6.3% |
| L | 517475 | 4.8% |
| S | 505817 | 4.7% |
| Y | 457165 | 4.3% |
| Other values (9) | 2263798 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 10707660 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| N | 1818291 | |
| O | 1126805 | |
| A | 1084871 | |
| E | 830704 | 7.8% |
| T | 763454 | 7.1% |
| R | 669640 | 6.3% |
| B | 669640 | 6.3% |
| L | 517475 | 4.8% |
| S | 505817 | 4.7% |
| Y | 457165 | 4.3% |
| Other values (9) | 2263798 |
ZIP CODE
Unsupported
MISSING  REJECTED  UNSUPPORTED 
| Missing | 649184 |
|---|---|
| Missing (%) | 31.1% |
| Memory size | 75.1 MiB |
LATITUDE
Real number (ℝ)
MISSING  SKEWED 
| Distinct | 126738 |
|---|---|
| Distinct (%) | 6.8% |
| Missing | 234300 |
| Missing (%) | 11.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.627378 |
| Minimum | 0 |
|---|---|
| Maximum | 43.344444 |
| Zeros | 4396 |
| Zeros (%) | 0.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 40.596612 |
| Q1 | 40.667774 |
| median | 40.720764 |
| Q3 | 40.769612 |
| 95-th percentile | 40.86204 |
| Maximum | 43.344444 |
| Range | 43.344444 |
| Interquartile range (IQR) | 0.1018378 |
Descriptive statistics
| Standard deviation | 1.9837524 |
|---|---|
| Coefficient of variation (CV) | 0.04882797 |
| Kurtosis | 414.76605 |
| Mean | 40.627378 |
| Median Absolute Deviation (MAD) | 0.0513402 |
| Skewness | -20.398327 |
| Sum | 75209158 |
| Variance | 3.9352735 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4396 | 0.2% |
| 40.861862 | 889 | < 0.1% |
| 40.696033 | 771 | < 0.1% |
| 40.8047 | 692 | < 0.1% |
| 40.608757 | 671 | < 0.1% |
| 40.798256 | 627 | < 0.1% |
| 40.759308 | 625 | < 0.1% |
| 40.6960346 | 587 | < 0.1% |
| 40.675735 | 559 | < 0.1% |
| 40.658577 | 523 | < 0.1% |
| Other values (126728) | 1840854 | |
| (Missing) | 234300 | 11.2% |
| Value | Count | Frequency (%) |
| 0 | 4396 | |
| 30.78418 | 1 | < 0.1% |
| 34.783634 | 1 | < 0.1% |
| 40.498947 | 1 | < 0.1% |
| 40.4989488 | 2 | < 0.1% |
| 40.4991346 | 1 | < 0.1% |
| 40.49931 | 1 | < 0.1% |
| 40.4994787 | 1 | < 0.1% |
| 40.499659 | 1 | < 0.1% |
| 40.49971 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 43.344444 | 1 | < 0.1% |
| 42.64154 | 1 | < 0.1% |
| 42.318317 | 1 | < 0.1% |
| 42.107204 | 1 | < 0.1% |
| 41.91661 | 1 | < 0.1% |
| 41.34796 | 1 | < 0.1% |
| 41.258785 | 1 | < 0.1% |
| 41.12615 | 5 | |
| 41.12421 | 1 | < 0.1% |
| 41.061634 | 2 | < 0.1% |
LONGITUDE
Real number (ℝ)
MISSING 
| Distinct | 98436 |
|---|---|
| Distinct (%) | 5.3% |
| Missing | 234300 |
| Missing (%) | 11.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -73.751547 |
| Minimum | -201.35999 |
|---|---|
| Maximum | 0 |
| Zeros | 4396 |
| Zeros (%) | 0.2% |
| Negative | 1846798 |
| Negative (%) | 88.6% |
| Memory size | 15.9 MiB |
Quantile statistics
| Minimum | -201.35999 |
|---|---|
| 5-th percentile | -74.03613 |
| Q1 | -73.97483 |
| median | -73.92726 |
| Q3 | -73.866731 |
| 95-th percentile | -73.76325 |
| Maximum | 0 |
| Range | 201.35999 |
| Interquartile range (IQR) | 0.1080989 |
Descriptive statistics
| Standard deviation | 3.7281252 |
|---|---|
| Coefficient of variation (CV) | -0.05054979 |
| Kurtosis | 439.11784 |
| Mean | -73.751547 |
| Median Absolute Deviation (MAD) | 0.052606 |
| Skewness | 16.106495 |
| Sum | -1.3652842 × 108 |
| Variance | 13.898918 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 4396 | 0.2% |
| -73.89063 | 768 | < 0.1% |
| -73.91282 | 719 | < 0.1% |
| -73.98453 | 700 | < 0.1% |
| -74.038086 | 672 | < 0.1% |
| -73.89686 | 659 | < 0.1% |
| -73.91243 | 654 | < 0.1% |
| -73.94476 | 590 | < 0.1% |
| -73.9845292 | 587 | < 0.1% |
| -73.9112 | 581 | < 0.1% |
| Other values (98426) | 1840868 | |
| (Missing) | 234300 | 11.2% |
| Value | Count | Frequency (%) |
| -201.35999 | 1 | < 0.1% |
| -201.23706 | 105 | |
| -89.13527 | 1 | < 0.1% |
| -86.76847 | 1 | < 0.1% |
| -79.61955 | 1 | < 0.1% |
| -79.00183 | 1 | < 0.1% |
| -76.2634 | 1 | < 0.1% |
| -76.02163 | 1 | < 0.1% |
| -74.742 | 7 | < 0.1% |
| -74.25496 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 4396 | |
| -32.768513 | 16 | < 0.1% |
| -47.209625 | 3 | < 0.1% |
| -73.66301 | 1 | < 0.1% |
| -73.70055 | 2 | < 0.1% |
| -73.700584 | 11 | < 0.1% |
| -73.7005968 | 10 | < 0.1% |
| -73.70061 | 4 | < 0.1% |
| -73.70071 | 4 | < 0.1% |
| -73.70073 | 1 | < 0.1% |
LOCATION
Text
MISSING 
| Distinct | 284545 |
|---|---|
| Distinct (%) | 15.4% |
| Missing | 234300 |
| Missing (%) | 11.2% |
| Memory size | 148.0 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 24 |
| Mean length | 22.774418 |
| Min length | 10 |
Characters and Unicode
| Total characters | 42159866 |
|---|---|
| Distinct characters | 16 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 156445 ? |
|---|---|
| Unique (%) | 8.5% |
Sample
| 1st row | (40.667202, -73.8665) |
|---|---|
| 2nd row | (40.683304, -73.917274) |
| 3rd row | (40.709183, -73.956825) |
| 4th row | (40.86816, -73.83148) |
| 5th row | (40.67172, -73.8971) |
| Value | Count | Frequency (%) |
| 0.0 | 8792 | 0.2% |
| 40.861862 | 889 | < 0.1% |
| 40.696033 | 771 | < 0.1% |
| 73.89063 | 768 | < 0.1% |
| 73.91282 | 719 | < 0.1% |
| 73.98453 | 700 | < 0.1% |
| 40.8047 | 692 | < 0.1% |
| 74.038086 | 672 | < 0.1% |
| 40.608757 | 671 | < 0.1% |
| 73.89686 | 659 | < 0.1% |
| Other values (225163) | 3687055 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 4617330 | |
| 4 | 4000028 | 9.5% |
| . | 3702388 | 8.8% |
| 3 | 3515270 | 8.3% |
| 0 | 3417232 | 8.1% |
| 9 | 2712408 | 6.4% |
| 8 | 2661399 | 6.3% |
| 6 | 2629407 | 6.2% |
| 5 | 2104583 | 5.0% |
| ( | 1851194 | 4.4% |
| Other values (6) | 10948627 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 42159866 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 7 | 4617330 | |
| 4 | 4000028 | 9.5% |
| . | 3702388 | 8.8% |
| 3 | 3515270 | 8.3% |
| 0 | 3417232 | 8.1% |
| 9 | 2712408 | 6.4% |
| 8 | 2661399 | 6.3% |
| 6 | 2629407 | 6.2% |
| 5 | 2104583 | 5.0% |
| ( | 1851194 | 4.4% |
| Other values (6) | 10948627 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 42159866 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 7 | 4617330 | |
| 4 | 4000028 | 9.5% |
| . | 3702388 | 8.8% |
| 3 | 3515270 | 8.3% |
| 0 | 3417232 | 8.1% |
| 9 | 2712408 | 6.4% |
| 8 | 2661399 | 6.3% |
| 6 | 2629407 | 6.2% |
| 5 | 2104583 | 5.0% |
| ( | 1851194 | 4.4% |
| Other values (6) | 10948627 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 42159866 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 7 | 4617330 | |
| 4 | 4000028 | 9.5% |
| . | 3702388 | 8.8% |
| 3 | 3515270 | 8.3% |
| 0 | 3417232 | 8.1% |
| 9 | 2712408 | 6.4% |
| 8 | 2661399 | 6.3% |
| 6 | 2629407 | 6.2% |
| 5 | 2104583 | 5.0% |
| ( | 1851194 | 4.4% |
| Other values (6) | 10948627 |
ON STREET NAME
Text
MISSING 
| Distinct | 18458 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 443414 |
| Missing (%) | 21.3% |
| Memory size | 149.1 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 29.56421 |
| Min length | 2 |
Characters and Unicode
| Total characters | 48546798 |
|---|---|
| Distinct characters | 75 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6551 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | WHITESTONE EXPRESSWAY |
|---|---|
| 2nd row | QUEENSBORO BRIDGE UPPER |
| 3rd row | THROGS NECK BRIDGE |
| 4th row | SARATOGA AVENUE |
| 5th row | MAJOR DEEGAN EXPRESSWAY RAMP |
| Value | Count | Frequency (%) |
| avenue | 610782 | 16.1% |
| street | 522974 | 13.8% |
| east | 154098 | 4.1% |
| boulevard | 127527 | 3.4% |
| west | 115215 | 3.0% |
| parkway | 75148 | 2.0% |
| road | 68379 | 1.8% |
| expressway | 63767 | 1.7% |
| island | 30625 | 0.8% |
| queens | 27288 | 0.7% |
| Other values (5394) | 1993059 |
Most occurring characters
| Value | Count | Frequency (%) |
| 27572212 | ||
| E | 3689040 | 7.6% |
| A | 1960159 | 4.0% |
| T | 1839600 | 3.8% |
| R | 1677642 | 3.5% |
| N | 1434455 | 3.0% |
| S | 1414317 | 2.9% |
| U | 981985 | 2.0% |
| O | 872965 | 1.8% |
| V | 855815 | 1.8% |
| Other values (65) | 6248608 | 12.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 48546798 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 27572212 | ||
| E | 3689040 | 7.6% |
| A | 1960159 | 4.0% |
| T | 1839600 | 3.8% |
| R | 1677642 | 3.5% |
| N | 1434455 | 3.0% |
| S | 1414317 | 2.9% |
| U | 981985 | 2.0% |
| O | 872965 | 1.8% |
| V | 855815 | 1.8% |
| Other values (65) | 6248608 | 12.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 48546798 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 27572212 | ||
| E | 3689040 | 7.6% |
| A | 1960159 | 4.0% |
| T | 1839600 | 3.8% |
| R | 1677642 | 3.5% |
| N | 1434455 | 3.0% |
| S | 1414317 | 2.9% |
| U | 981985 | 2.0% |
| O | 872965 | 1.8% |
| V | 855815 | 1.8% |
| Other values (65) | 6248608 | 12.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 48546798 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 27572212 | ||
| E | 3689040 | 7.6% |
| A | 1960159 | 4.0% |
| T | 1839600 | 3.8% |
| R | 1677642 | 3.5% |
| N | 1434455 | 3.0% |
| S | 1414317 | 2.9% |
| U | 981985 | 2.0% |
| O | 872965 | 1.8% |
| V | 855815 | 1.8% |
| Other values (65) | 6248608 | 12.9% |
MISSING 
| Distinct | 20256 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 789605 |
| Missing (%) | 37.9% |
| Memory size | 122.6 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 32 |
| Mean length | 22.670409 |
| Min length | 1 |
Characters and Unicode
| Total characters | 29378334 |
|---|---|
| Distinct characters | 76 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 6202 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 20 AVENUE |
|---|---|
| 2nd row | DECATUR STREET |
| 3rd row | EAST 43 STREET |
| 4th row | EAST GATE PLAZA |
| 5th row | west 80 street -west 81 street |
| Value | Count | Frequency (%) |
| avenue | 567479 | 19.8% |
| street | 461146 | 16.1% |
| east | 112564 | 3.9% |
| west | 71342 | 2.5% |
| boulevard | 68928 | 2.4% |
| road | 55777 | 1.9% |
| place | 34080 | 1.2% |
| parkway | 26719 | 0.9% |
| 3 | 18826 | 0.7% |
| park | 17492 | 0.6% |
| Other values (5485) | 1431852 |
Most occurring characters
| Value | Count | Frequency (%) |
| 14121512 | ||
| E | 2948298 | 10.0% |
| T | 1458774 | 5.0% |
| A | 1425079 | 4.9% |
| R | 1151727 | 3.9% |
| N | 1079107 | 3.7% |
| S | 992513 | 3.4% |
| U | 780282 | 2.7% |
| V | 711591 | 2.4% |
| O | 580789 | 2.0% |
| Other values (66) | 4128662 | 14.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 29378334 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 14121512 | ||
| E | 2948298 | 10.0% |
| T | 1458774 | 5.0% |
| A | 1425079 | 4.9% |
| R | 1151727 | 3.9% |
| N | 1079107 | 3.7% |
| S | 992513 | 3.4% |
| U | 780282 | 2.7% |
| V | 711591 | 2.4% |
| O | 580789 | 2.0% |
| Other values (66) | 4128662 | 14.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 29378334 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 14121512 | ||
| E | 2948298 | 10.0% |
| T | 1458774 | 5.0% |
| A | 1425079 | 4.9% |
| R | 1151727 | 3.9% |
| N | 1079107 | 3.7% |
| S | 992513 | 3.4% |
| U | 780282 | 2.7% |
| V | 711591 | 2.4% |
| O | 580789 | 2.0% |
| Other values (66) | 4128662 | 14.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 29378334 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 14121512 | ||
| E | 2948298 | 10.0% |
| T | 1458774 | 5.0% |
| A | 1425079 | 4.9% |
| R | 1151727 | 3.9% |
| N | 1079107 | 3.7% |
| S | 992513 | 3.4% |
| U | 780282 | 2.7% |
| V | 711591 | 2.4% |
| O | 580789 | 2.0% |
| Other values (66) | 4128662 | 14.1% |
OFF STREET NAME
Text
MISSING 
| Distinct | 227639 |
|---|---|
| Distinct (%) | 64.8% |
| Missing | 1734453 |
| Missing (%) | 83.2% |
| Memory size | 84.0 MiB |
Length
| Max length | 40 |
|---|---|
| Median length | 40 |
| Mean length | 35.917465 |
| Min length | 8 |
Characters and Unicode
| Total characters | 12608503 |
|---|---|
| Distinct characters | 84 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 177508 ? |
|---|---|
| Unique (%) | 50.6% |
Sample
| 1st row | 1211 LORING AVENUE |
|---|---|
| 2nd row | 344 BAYCHESTER AVENUE |
| 3rd row | 2047 PITKIN AVENUE |
| 4th row | 480 DEAN STREET |
| 5th row | 878 FLATBUSH AVENUE |
| Value | Count | Frequency (%) |
| avenue | 139121 | 11.9% |
| street | 126988 | 10.9% |
| east | 33467 | 2.9% |
| west | 24196 | 2.1% |
| boulevard | 22279 | 1.9% |
| road | 16574 | 1.4% |
| lot | 7881 | 0.7% |
| parking | 7267 | 0.6% |
| parkway | 6996 | 0.6% |
| of | 6954 | 0.6% |
| Other values (27625) | 775853 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6887772 | ||
| E | 803446 | 6.4% |
| T | 439996 | 3.5% |
| A | 411834 | 3.3% |
| R | 342330 | 2.7% |
| N | 300915 | 2.4% |
| S | 288305 | 2.3% |
| 1 | 279274 | 2.2% |
| U | 204657 | 1.6% |
| V | 190938 | 1.5% |
| Other values (74) | 2459036 | 19.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 12608503 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 6887772 | ||
| E | 803446 | 6.4% |
| T | 439996 | 3.5% |
| A | 411834 | 3.3% |
| R | 342330 | 2.7% |
| N | 300915 | 2.4% |
| S | 288305 | 2.3% |
| 1 | 279274 | 2.2% |
| U | 204657 | 1.6% |
| V | 190938 | 1.5% |
| Other values (74) | 2459036 | 19.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 12608503 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 6887772 | ||
| E | 803446 | 6.4% |
| T | 439996 | 3.5% |
| A | 411834 | 3.3% |
| R | 342330 | 2.7% |
| N | 300915 | 2.4% |
| S | 288305 | 2.3% |
| 1 | 279274 | 2.2% |
| U | 204657 | 1.6% |
| V | 190938 | 1.5% |
| Other values (74) | 2459036 | 19.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 12608503 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 6887772 | ||
| E | 803446 | 6.4% |
| T | 439996 | 3.5% |
| A | 411834 | 3.3% |
| R | 342330 | 2.7% |
| N | 300915 | 2.4% |
| S | 288305 | 2.3% |
| 1 | 279274 | 2.2% |
| U | 204657 | 1.6% |
| V | 190938 | 1.5% |
| Other values (74) | 2459036 | 19.5% |
NUMBER OF PERSONS INJURED
Real number (ℝ)
ZEROS 
| Distinct | 32 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.31104122 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1607054 |
| Zeros (%) | 77.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 2 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.70104436 |
|---|---|
| Coefficient of variation (CV) | 2.2538632 |
| Kurtosis | 50.982476 |
| Mean | 0.31104122 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.2497531 |
| Sum | 648669 |
| Variance | 0.49146319 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1607054 | |
| 1 | 371328 | 17.8% |
| 2 | 69944 | 3.4% |
| 3 | 22850 | 1.1% |
| 4 | 8466 | 0.4% |
| 5 | 3249 | 0.2% |
| 6 | 1361 | 0.1% |
| 7 | 579 | < 0.1% |
| 8 | 255 | < 0.1% |
| 9 | 130 | < 0.1% |
| Other values (22) | 260 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1607054 | |
| 1 | 371328 | 17.8% |
| 2 | 69944 | 3.4% |
| 3 | 22850 | 1.1% |
| 4 | 8466 | 0.4% |
| 5 | 3249 | 0.2% |
| 6 | 1361 | 0.1% |
| 7 | 579 | < 0.1% |
| 8 | 255 | < 0.1% |
| 9 | 130 | < 0.1% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 32 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 27 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 3 | |
| 23 | 1 | < 0.1% |
| 22 | 3 |
NUMBER OF PERSONS KILLED
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 31 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.0015003862 |
| Minimum | 0 |
|---|---|
| Maximum | 8 |
| Zeros | 2082457 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 8 |
| Range | 8 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.040839716 |
|---|---|
| Coefficient of variation (CV) | 27.219468 |
| Kurtosis | 1922.4521 |
| Mean | 0.0015003862 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 33.616616 |
| Sum | 3129 |
| Variance | 0.0016678824 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2082457 | |
| 1 | 2913 | 0.1% |
| 2 | 75 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| (Missing) | 31 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2082457 | |
| 1 | 2913 | 0.1% |
| 2 | 75 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 3 | < 0.1% |
| 5 | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 8 | 1 | < 0.1% |
| 5 | 2 | < 0.1% |
| 4 | 3 | < 0.1% |
| 3 | 12 | < 0.1% |
| 2 | 75 | < 0.1% |
| 1 | 2913 | 0.1% |
| 0 | 2082457 |
NUMBER OF PEDESTRIANS INJURED
Real number (ℝ)
ZEROS 
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.056723731 |
| Minimum | 0 |
|---|---|
| Maximum | 27 |
| Zeros | 1972076 |
| Zeros (%) | 94.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 27 |
| Range | 27 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2443857 |
|---|---|
| Coefficient of variation (CV) | 4.3083502 |
| Kurtosis | 127.90885 |
| Mean | 0.056723731 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.6654502 |
| Sum | 118297 |
| Variance | 0.059724369 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1972076 | |
| 1 | 109261 | 5.2% |
| 2 | 3679 | 0.2% |
| 3 | 369 | < 0.1% |
| 4 | 61 | < 0.1% |
| 5 | 25 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 4 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1972076 | |
| 1 | 109261 | 5.2% |
| 2 | 3679 | 0.2% |
| 3 | 369 | < 0.1% |
| 4 | 61 | < 0.1% |
| 5 | 25 | < 0.1% |
| 6 | 11 | < 0.1% |
| 7 | 4 | < 0.1% |
| 8 | 2 | < 0.1% |
| 9 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 27 | 1 | < 0.1% |
| 19 | 1 | < 0.1% |
| 15 | 1 | < 0.1% |
| 13 | 1 | < 0.1% |
| 9 | 2 | < 0.1% |
| 8 | 2 | < 0.1% |
| 7 | 4 | < 0.1% |
| 6 | 11 | < 0.1% |
| 5 | 25 | |
| 4 | 61 |
NUMBER OF PEDESTRIANS KILLED
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 115.4 MiB |
| 0 | |
|---|---|
| 1 | 1520 |
| 2 | 12 |
| 6 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2085494 |
|---|---|
| Distinct characters | 4 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2083961 | |
| 1 | 1520 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2083961 | |
| 1 | 1520 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2083961 | |
| 1 | 1520 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2085494 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2083961 | |
| 1 | 1520 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2085494 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2083961 | |
| 1 | 1520 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2085494 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2083961 | |
| 1 | 1520 | 0.1% |
| 2 | 12 | < 0.1% |
| 6 | 1 | < 0.1% |
NUMBER OF CYCLIST INJURED
Categorical
IMBALANCE 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 115.4 MiB |
| 0 | |
|---|---|
| 1 | 54849 |
| 2 | 609 |
| 3 | 23 |
| 4 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2085494 |
|---|---|
| Distinct characters | 5 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2030012 | |
| 1 | 54849 | 2.6% |
| 2 | 609 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2030012 | |
| 1 | 54849 | 2.6% |
| 2 | 609 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2030012 | |
| 1 | 54849 | 2.6% |
| 2 | 609 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2085494 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2030012 | |
| 1 | 54849 | 2.6% |
| 2 | 609 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2085494 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2030012 | |
| 1 | 54849 | 2.6% |
| 2 | 609 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2085494 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2030012 | |
| 1 | 54849 | 2.6% |
| 2 | 609 | < 0.1% |
| 3 | 23 | < 0.1% |
| 4 | 1 | < 0.1% |
NUMBER OF CYCLIST KILLED
Categorical
IMBALANCE 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 115.4 MiB |
| 0 | |
|---|---|
| 1 | 239 |
| 2 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 2085494 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2085254 | |
| 1 | 239 | < 0.1% |
| 2 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2085254 | |
| 1 | 239 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2085254 | |
| 1 | 239 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2085494 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2085254 | |
| 1 | 239 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2085494 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2085254 | |
| 1 | 239 | < 0.1% |
| 2 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2085494 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 0 | 2085254 | |
| 1 | 239 | < 0.1% |
| 2 | 1 | < 0.1% |
NUMBER OF MOTORIST INJURED
Real number (ℝ)
ZEROS 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.22369904 |
| Minimum | 0 |
|---|---|
| Maximum | 43 |
| Zeros | 1780333 |
| Zeros (%) | 85.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 43 |
| Range | 43 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.66221489 |
|---|---|
| Coefficient of variation (CV) | 2.9602939 |
| Kurtosis | 63.304937 |
| Mean | 0.22369904 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.1139917 |
| Sum | 466523 |
| Variance | 0.43852857 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1780333 | |
| 1 | 205209 | 9.8% |
| 2 | 63820 | 3.1% |
| 3 | 22153 | 1.1% |
| 4 | 8292 | 0.4% |
| 5 | 3198 | 0.2% |
| 6 | 1315 | 0.1% |
| 7 | 553 | < 0.1% |
| 8 | 247 | < 0.1% |
| 9 | 125 | < 0.1% |
| Other values (21) | 249 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 1780333 | |
| 1 | 205209 | 9.8% |
| 2 | 63820 | 3.1% |
| 3 | 22153 | 1.1% |
| 4 | 8292 | 0.4% |
| 5 | 3198 | 0.2% |
| 6 | 1315 | 0.1% |
| 7 | 553 | < 0.1% |
| 8 | 247 | < 0.1% |
| 9 | 125 | < 0.1% |
| Value | Count | Frequency (%) |
| 43 | 1 | < 0.1% |
| 40 | 1 | < 0.1% |
| 34 | 1 | < 0.1% |
| 31 | 1 | < 0.1% |
| 30 | 1 | < 0.1% |
| 25 | 1 | < 0.1% |
| 24 | 3 | |
| 23 | 1 | < 0.1% |
| 22 | 2 | |
| 21 | 1 | < 0.1% |
NUMBER OF MOTORIST KILLED
Real number (ℝ)
SKEWED  ZEROS 
| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.00061807898 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 2084302 |
| Zeros (%) | 99.9% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.027193584 |
|---|---|
| Coefficient of variation (CV) | 43.99694 |
| Kurtosis | 4196.5461 |
| Mean | 0.00061807898 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 54.56007 |
| Sum | 1289 |
| Variance | 0.000739491 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 2084302 | |
| 1 | 1117 | 0.1% |
| 2 | 59 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 2084302 | |
| 1 | 1117 | 0.1% |
| 2 | 59 | < 0.1% |
| 3 | 12 | < 0.1% |
| 4 | 2 | < 0.1% |
| 5 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 2 | < 0.1% |
| 4 | 2 | < 0.1% |
| 3 | 12 | < 0.1% |
| 2 | 59 | < 0.1% |
| 1 | 1117 | 0.1% |
| 0 | 2084302 |
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6865 |
| Missing (%) | 0.3% |
| Memory size | 151.9 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 43 |
| Mean length | 19.513255 |
| Min length | 1 |
Characters and Unicode
| Total characters | 40560818 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Aggressive Driving/Road Rage |
|---|---|
| 2nd row | Pavement Slippery |
| 3rd row | Following Too Closely |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 709158 | |
| driver | 450464 | 10.9% |
| inattention/distraction | 417718 | 10.1% |
| too | 163564 | 3.9% |
| closely | 163564 | 3.9% |
| to | 148902 | 3.6% |
| failure | 130215 | 3.1% |
| yield | 123992 | 3.0% |
| right-of-way | 123992 | 3.0% |
| following | 111587 | 2.7% |
| Other values (96) | 1600235 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 4563606 | 11.3% |
| e | 4131184 | 10.2% |
| n | 3525907 | 8.7% |
| t | 2814191 | 6.9% |
| o | 2393126 | 5.9% |
| r | 2382367 | 5.9% |
| s | 2107618 | 5.2% |
| 2064762 | 5.1% | |
| a | 2001035 | 4.9% |
| c | 1562117 | 3.9% |
| Other values (45) | 13014905 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 40560818 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 4563606 | 11.3% |
| e | 4131184 | 10.2% |
| n | 3525907 | 8.7% |
| t | 2814191 | 6.9% |
| o | 2393126 | 5.9% |
| r | 2382367 | 5.9% |
| s | 2107618 | 5.2% |
| 2064762 | 5.1% | |
| a | 2001035 | 4.9% |
| c | 1562117 | 3.9% |
| Other values (45) | 13014905 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 40560818 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 4563606 | 11.3% |
| e | 4131184 | 10.2% |
| n | 3525907 | 8.7% |
| t | 2814191 | 6.9% |
| o | 2393126 | 5.9% |
| r | 2382367 | 5.9% |
| s | 2107618 | 5.2% |
| 2064762 | 5.1% | |
| a | 2001035 | 4.9% |
| c | 1562117 | 3.9% |
| Other values (45) | 13014905 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 40560818 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 4563606 | 11.3% |
| e | 4131184 | 10.2% |
| n | 3525907 | 8.7% |
| t | 2814191 | 6.9% |
| o | 2393126 | 5.9% |
| r | 2382367 | 5.9% |
| s | 2107618 | 5.2% |
| 2064762 | 5.1% | |
| a | 2001035 | 4.9% |
| c | 1562117 | 3.9% |
| Other values (45) | 13014905 |
CONTRIBUTING FACTOR VEHICLE 2
Text
MISSING 
| Distinct | 61 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 324030 |
| Missing (%) | 15.5% |
| Memory size | 127.6 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 13.049252 |
| Min length | 1 |
Characters and Unicode
| Total characters | 22985787 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 1483034 | |
| driver | 101442 | 4.7% |
| inattention/distraction | 94701 | 4.4% |
| other | 33254 | 1.5% |
| vehicular | 32189 | 1.5% |
| too | 27894 | 1.3% |
| closely | 27894 | 1.3% |
| passing | 21660 | 1.0% |
| to | 21608 | 1.0% |
| lane | 20197 | 0.9% |
| Other values (96) | 296990 | 13.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 3623508 | |
| e | 3526950 | |
| n | 2060162 | |
| s | 1765767 | |
| c | 1673714 | |
| d | 1556899 | |
| p | 1553108 | |
| f | 1539442 | |
| U | 1519713 | |
| t | 622043 | 2.7% |
| Other values (45) | 3544481 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 22985787 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| i | 3623508 | |
| e | 3526950 | |
| n | 2060162 | |
| s | 1765767 | |
| c | 1673714 | |
| d | 1556899 | |
| p | 1553108 | |
| f | 1539442 | |
| U | 1519713 | |
| t | 622043 | 2.7% |
| Other values (45) | 3544481 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 22985787 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| i | 3623508 | |
| e | 3526950 | |
| n | 2060162 | |
| s | 1765767 | |
| c | 1673714 | |
| d | 1556899 | |
| p | 1553108 | |
| f | 1539442 | |
| U | 1519713 | |
| t | 622043 | 2.7% |
| Other values (45) | 3544481 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 22985787 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| i | 3623508 | |
| e | 3526950 | |
| n | 2060162 | |
| s | 1765767 | |
| c | 1673714 | |
| d | 1556899 | |
| p | 1553108 | |
| f | 1539442 | |
| U | 1519713 | |
| t | 622043 | 2.7% |
| Other values (45) | 3544481 |
CONTRIBUTING FACTOR VEHICLE 3
Text
MISSING 
| Distinct | 51 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1936332 |
| Missing (%) | 92.8% |
| Memory size | 68.9 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 11 |
| Mean length | 11.657151 |
| Min length | 1 |
Characters and Unicode
| Total characters | 1738804 |
|---|---|
| Distinct characters | 55 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
| Value | Count | Frequency (%) |
| unspecified | 139044 | |
| other | 2839 | 1.8% |
| vehicular | 2799 | 1.7% |
| driver | 2150 | 1.3% |
| too | 2024 | 1.2% |
| closely | 2024 | 1.2% |
| following | 1970 | 1.2% |
| inattention/distraction | 1967 | 1.2% |
| fatigued/drowsy | 853 | 0.5% |
| pavement | 414 | 0.3% |
| Other values (79) | 5951 | 3.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 297136 | |
| i | 295801 | |
| n | 152490 | |
| s | 146038 | |
| c | 145473 | |
| d | 141153 | |
| p | 140719 | |
| f | 139955 | |
| U | 139713 | |
| o | 17376 | 1.0% |
| Other values (45) | 122950 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 1738804 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 297136 | |
| i | 295801 | |
| n | 152490 | |
| s | 146038 | |
| c | 145473 | |
| d | 141153 | |
| p | 140719 | |
| f | 139955 | |
| U | 139713 | |
| o | 17376 | 1.0% |
| Other values (45) | 122950 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 1738804 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 297136 | |
| i | 295801 | |
| n | 152490 | |
| s | 146038 | |
| c | 145473 | |
| d | 141153 | |
| p | 140719 | |
| f | 139955 | |
| U | 139713 | |
| o | 17376 | 1.0% |
| Other values (45) | 122950 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 1738804 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 297136 | |
| i | 295801 | |
| n | 152490 | |
| s | 146038 | |
| c | 145473 | |
| d | 141153 | |
| p | 140719 | |
| f | 139955 | |
| U | 139713 | |
| o | 17376 | 1.0% |
| Other values (45) | 122950 |
CONTRIBUTING FACTOR VEHICLE 4
Categorical
IMBALANCE  MISSING 
| Distinct | 41 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 2051785 |
| Missing (%) | 98.4% |
| Memory size | 127.4 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 623 |
| Following Too Closely | 392 |
| Driver Inattention/Distraction | 278 |
| Fatigued/Drowsy | 170 |
| Other values (36) | 453 |
Length
| Max length | 43 |
|---|---|
| Median length | 11 |
| Mean length | 11.490818 |
| Min length | 5 |
Characters and Unicode
| Total characters | 387344 |
|---|---|
| Distinct characters | 51 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
Common Values
| Value | Count | Frequency (%) |
| Unspecified | 31793 | 1.5% |
| Other Vehicular | 623 | < 0.1% |
| Following Too Closely | 392 | < 0.1% |
| Driver Inattention/Distraction | 278 | < 0.1% |
| Fatigued/Drowsy | 170 | < 0.1% |
| Pavement Slippery | 119 | < 0.1% |
| Reaction to Uninvolved Vehicle | 42 | < 0.1% |
| Unsafe Speed | 32 | < 0.1% |
| Outside Car Distraction | 29 | < 0.1% |
| Driver Inexperience | 27 | < 0.1% |
| Other values (31) | 204 | < 0.1% |
| (Missing) | 2051785 |
Length
| Value | Count | Frequency (%) |
| unspecified | 31793 | |
| other | 632 | 1.8% |
| vehicular | 623 | 1.7% |
| too | 397 | 1.1% |
| closely | 397 | 1.1% |
| following | 392 | 1.1% |
| driver | 305 | 0.8% |
| inattention/distraction | 278 | 0.8% |
| fatigued/drowsy | 170 | 0.5% |
| pavement | 122 | 0.3% |
| Other values (64) | 975 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 67193 | |
| i | 66571 | |
| n | 33888 | |
| c | 32970 | |
| s | 32946 | |
| p | 32161 | |
| d | 32149 | |
| f | 31921 | |
| U | 31901 | |
| o | 3097 | 0.8% |
| Other values (41) | 22547 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 387344 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 67193 | |
| i | 66571 | |
| n | 33888 | |
| c | 32970 | |
| s | 32946 | |
| p | 32161 | |
| d | 32149 | |
| f | 31921 | |
| U | 31901 | |
| o | 3097 | 0.8% |
| Other values (41) | 22547 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 387344 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 67193 | |
| i | 66571 | |
| n | 33888 | |
| c | 32970 | |
| s | 32946 | |
| p | 32161 | |
| d | 32149 | |
| f | 31921 | |
| U | 31901 | |
| o | 3097 | 0.8% |
| Other values (41) | 22547 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 387344 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 67193 | |
| i | 66571 | |
| n | 33888 | |
| c | 32970 | |
| s | 32946 | |
| p | 32161 | |
| d | 32149 | |
| f | 31921 | |
| U | 31901 | |
| o | 3097 | 0.8% |
| Other values (41) | 22547 | 5.8% |
CONTRIBUTING FACTOR VEHICLE 5
Categorical
IMBALANCE  MISSING 
| Distinct | 30 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2076357 |
| Missing (%) | 99.6% |
| Memory size | 127.3 MiB |
| Unspecified | |
|---|---|
| Other Vehicular | 181 |
| Following Too Closely | 99 |
| Driver Inattention/Distraction | 65 |
| Pavement Slippery | 50 |
| Other values (25) | 131 |
Length
| Max length | 43 |
|---|---|
| Median length | 11 |
| Mean length | 11.469738 |
| Min length | 5 |
Characters and Unicode
| Total characters | 104799 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Unspecified |
|---|---|
| 2nd row | Unspecified |
| 3rd row | Unspecified |
| 4th row | Unspecified |
| 5th row | Unspecified |
Common Values
| Value | Count | Frequency (%) |
| Unspecified | 8611 | 0.4% |
| Other Vehicular | 181 | < 0.1% |
| Following Too Closely | 99 | < 0.1% |
| Driver Inattention/Distraction | 65 | < 0.1% |
| Pavement Slippery | 50 | < 0.1% |
| Fatigued/Drowsy | 41 | < 0.1% |
| Reaction to Uninvolved Vehicle | 12 | < 0.1% |
| Alcohol Involvement | 11 | < 0.1% |
| Obstruction/Debris | 10 | < 0.1% |
| Driver Inexperience | 10 | < 0.1% |
| Other values (20) | 47 | < 0.1% |
| (Missing) | 2076357 |
Length
| Value | Count | Frequency (%) |
| unspecified | 8611 | |
| other | 183 | 1.9% |
| vehicular | 181 | 1.9% |
| too | 101 | 1.0% |
| closely | 101 | 1.0% |
| following | 99 | 1.0% |
| driver | 75 | 0.8% |
| inattention/distraction | 65 | 0.7% |
| pavement | 51 | 0.5% |
| slippery | 50 | 0.5% |
| Other values (47) | 251 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 18245 | |
| i | 18001 | |
| n | 9144 | |
| c | 8935 | |
| s | 8884 | |
| p | 8739 | |
| d | 8696 | |
| f | 8638 | |
| U | 8634 | |
| o | 788 | 0.8% |
| Other values (40) | 6095 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 104799 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| e | 18245 | |
| i | 18001 | |
| n | 9144 | |
| c | 8935 | |
| s | 8884 | |
| p | 8739 | |
| d | 8696 | |
| f | 8638 | |
| U | 8634 | |
| o | 788 | 0.8% |
| Other values (40) | 6095 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 104799 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| e | 18245 | |
| i | 18001 | |
| n | 9144 | |
| c | 8935 | |
| s | 8884 | |
| p | 8739 | |
| d | 8696 | |
| f | 8638 | |
| U | 8634 | |
| o | 788 | 0.8% |
| Other values (40) | 6095 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 104799 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| e | 18245 | |
| i | 18001 | |
| n | 9144 | |
| c | 8935 | |
| s | 8884 | |
| p | 8739 | |
| d | 8696 | |
| f | 8638 | |
| U | 8634 | |
| o | 788 | 0.8% |
| Other values (40) | 6095 | 5.8% |
COLLISION_ID
Real number (ℝ)
UNIQUE 
| Distinct | 2085494 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3167144.8 |
| Minimum | 22 |
|---|---|
| Maximum | 4722272 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 15.9 MiB |
Quantile statistics
| Minimum | 22 |
|---|---|
| 5-th percentile | 105128.65 |
| Q1 | 3157493.2 |
| median | 3678992.5 |
| Q3 | 4200608.8 |
| 95-th percentile | 4617783.3 |
| Maximum | 4722272 |
| Range | 4722250 |
| Interquartile range (IQR) | 1043115.5 |
Descriptive statistics
| Standard deviation | 1505387.7 |
|---|---|
| Coefficient of variation (CV) | 0.47531383 |
| Kurtosis | -0.019103384 |
| Mean | 3167144.8 |
| Median Absolute Deviation (MAD) | 521558 |
| Skewness | -1.226785 |
| Sum | 6.6050615 × 1012 |
| Variance | 2.2661922 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4455765 | 1 | < 0.1% |
| 3174628 | 1 | < 0.1% |
| 3172280 | 1 | < 0.1% |
| 3160927 | 1 | < 0.1% |
| 3173224 | 1 | < 0.1% |
| 3171866 | 1 | < 0.1% |
| 3172720 | 1 | < 0.1% |
| 3162782 | 1 | < 0.1% |
| 3168818 | 1 | < 0.1% |
| 3159257 | 1 | < 0.1% |
| Other values (2085484) | 2085484 |
| Value | Count | Frequency (%) |
| 22 | 1 | |
| 23 | 1 | |
| 24 | 1 | |
| 25 | 1 | |
| 26 | 1 | |
| 27 | 1 | |
| 28 | 1 | |
| 29 | 1 | |
| 30 | 1 | |
| 31 | 1 |
| Value | Count | Frequency (%) |
| 4722272 | 1 | |
| 4722270 | 1 | |
| 4722268 | 1 | |
| 4722265 | 1 | |
| 4722264 | 1 | |
| 4722263 | 1 | |
| 4722260 | 1 | |
| 4722259 | 1 | |
| 4722254 | 1 | |
| 4722253 | 1 |
| Distinct | 1647 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 13866 |
| Missing (%) | 0.7% |
| Memory size | 146.4 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 35 |
| Mean length | 16.882971 |
| Min length | 1 |
Characters and Unicode
| Total characters | 34975236 |
|---|---|
| Distinct characters | 75 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1001 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Sedan |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Dump |
| Value | Count | Frequency (%) |
| vehicle | 883823 | |
| utility | 637368 | |
| station | 637325 | |
| sedan | 623996 | |
| wagon/sport | 457034 | |
| passenger | 416219 | |
| 181675 | 3.7% | |
| wagon | 180355 | 3.7% |
| sport | 180291 | 3.7% |
| truck | 86442 | 1.8% |
| Other values (957) | 618147 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2844267 | 8.1% | |
| S | 2747229 | 7.9% |
| t | 2318875 | 6.6% |
| i | 1953175 | 5.6% |
| E | 1819085 | 5.2% |
| a | 1632608 | 4.7% |
| e | 1623631 | 4.6% |
| n | 1560172 | 4.5% |
| o | 1447330 | 4.1% |
| T | 1142647 | 3.3% |
| Other values (65) | 15886217 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 34975236 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2844267 | 8.1% | |
| S | 2747229 | 7.9% |
| t | 2318875 | 6.6% |
| i | 1953175 | 5.6% |
| E | 1819085 | 5.2% |
| a | 1632608 | 4.7% |
| e | 1623631 | 4.6% |
| n | 1560172 | 4.5% |
| o | 1447330 | 4.1% |
| T | 1142647 | 3.3% |
| Other values (65) | 15886217 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 34975236 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2844267 | 8.1% | |
| S | 2747229 | 7.9% |
| t | 2318875 | 6.6% |
| i | 1953175 | 5.6% |
| E | 1819085 | 5.2% |
| a | 1632608 | 4.7% |
| e | 1623631 | 4.6% |
| n | 1560172 | 4.5% |
| o | 1447330 | 4.1% |
| T | 1142647 | 3.3% |
| Other values (65) | 15886217 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 34975236 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2844267 | 8.1% | |
| S | 2747229 | 7.9% |
| t | 2318875 | 6.6% |
| i | 1953175 | 5.6% |
| E | 1819085 | 5.2% |
| a | 1632608 | 4.7% |
| e | 1623631 | 4.6% |
| n | 1560172 | 4.5% |
| o | 1447330 | 4.1% |
| T | 1142647 | 3.3% |
| Other values (65) | 15886217 |
MISSING 
| Distinct | 1834 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 399976 |
| Missing (%) | 19.2% |
| Memory size | 129.7 MiB |
Length
| Max length | 38 |
|---|---|
| Median length | 30 |
| Mean length | 16.079394 |
| Min length | 1 |
Characters and Unicode
| Total characters | 27102108 |
|---|---|
| Distinct characters | 73 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 1085 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Pick-up Truck |
| 3rd row | Sedan |
| 4th row | Tractor Truck Diesel |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 655799 | |
| utility | 468831 | |
| station | 468803 | |
| sedan | 438283 | |
| wagon/sport | 328599 | |
| passenger | 318612 | |
| 141517 | 3.7% | |
| wagon | 140257 | 3.6% |
| sport | 140204 | 3.6% |
| truck | 85798 | 2.2% |
| Other values (1011) | 658039 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2172192 | 8.0% | |
| S | 2038120 | 7.5% |
| t | 1676594 | 6.2% |
| i | 1440794 | 5.3% |
| E | 1438910 | 5.3% |
| e | 1198012 | 4.4% |
| a | 1173113 | 4.3% |
| n | 1114429 | 4.1% |
| o | 1067178 | 3.9% |
| T | 920168 | 3.4% |
| Other values (63) | 12862598 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 27102108 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 2172192 | 8.0% | |
| S | 2038120 | 7.5% |
| t | 1676594 | 6.2% |
| i | 1440794 | 5.3% |
| E | 1438910 | 5.3% |
| e | 1198012 | 4.4% |
| a | 1173113 | 4.3% |
| n | 1114429 | 4.1% |
| o | 1067178 | 3.9% |
| T | 920168 | 3.4% |
| Other values (63) | 12862598 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 27102108 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 2172192 | 8.0% | |
| S | 2038120 | 7.5% |
| t | 1676594 | 6.2% |
| i | 1440794 | 5.3% |
| E | 1438910 | 5.3% |
| e | 1198012 | 4.4% |
| a | 1173113 | 4.3% |
| n | 1114429 | 4.1% |
| o | 1067178 | 3.9% |
| T | 920168 | 3.4% |
| Other values (63) | 12862598 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 27102108 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 2172192 | 8.0% | |
| S | 2038120 | 7.5% |
| t | 1676594 | 6.2% |
| i | 1440794 | 5.3% |
| E | 1438910 | 5.3% |
| e | 1198012 | 4.4% |
| a | 1173113 | 4.3% |
| n | 1114429 | 4.1% |
| o | 1067178 | 3.9% |
| T | 920168 | 3.4% |
| Other values (63) | 12862598 |
MISSING 
| Distinct | 263 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 1941769 |
| Missing (%) | 93.1% |
| Memory size | 69.5 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 17.68192 |
| Min length | 2 |
Characters and Unicode
| Total characters | 2541334 |
|---|---|
| Distinct characters | 62 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 154 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Sedan |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 64598 | |
| utility | 49809 | |
| station | 49807 | |
| sedan | 47549 | |
| wagon/sport | 36448 | |
| passenger | 27716 | |
| 13440 | 3.9% | |
| wagon | 13359 | 3.8% |
| sport | 13358 | 3.8% |
| truck | 4365 | 1.3% |
| Other values (219) | 28575 |
Most occurring characters
| Value | Count | Frequency (%) |
| 205734 | 8.1% | |
| S | 201673 | 7.9% |
| t | 183663 | 7.2% |
| i | 151728 | 6.0% |
| a | 124057 | 4.9% |
| e | 123604 | 4.9% |
| n | 121337 | 4.8% |
| E | 116407 | 4.6% |
| o | 112351 | 4.4% |
| T | 77081 | 3.0% |
| Other values (52) | 1123699 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 2541334 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 205734 | 8.1% | |
| S | 201673 | 7.9% |
| t | 183663 | 7.2% |
| i | 151728 | 6.0% |
| a | 124057 | 4.9% |
| e | 123604 | 4.9% |
| n | 121337 | 4.8% |
| E | 116407 | 4.6% |
| o | 112351 | 4.4% |
| T | 77081 | 3.0% |
| Other values (52) | 1123699 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 2541334 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 205734 | 8.1% | |
| S | 201673 | 7.9% |
| t | 183663 | 7.2% |
| i | 151728 | 6.0% |
| a | 124057 | 4.9% |
| e | 123604 | 4.9% |
| n | 121337 | 4.8% |
| E | 116407 | 4.6% |
| o | 112351 | 4.4% |
| T | 77081 | 3.0% |
| Other values (52) | 1123699 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 2541334 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 205734 | 8.1% | |
| S | 201673 | 7.9% |
| t | 183663 | 7.2% |
| i | 151728 | 6.0% |
| a | 124057 | 4.9% |
| e | 123604 | 4.9% |
| n | 121337 | 4.8% |
| E | 116407 | 4.6% |
| o | 112351 | 4.4% |
| T | 77081 | 3.0% |
| Other values (52) | 1123699 |
MISSING 
| Distinct | 103 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 2052958 |
| Missing (%) | 98.4% |
| Memory size | 65.0 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 17.978455 |
| Min length | 2 |
Characters and Unicode
| Total characters | 584947 |
|---|---|
| Distinct characters | 57 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 47 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Sedan |
| 3rd row | Station Wagon/Sport Utility Vehicle |
| 4th row | Sedan |
| 5th row | Sedan |
| Value | Count | Frequency (%) |
| vehicle | 14990 | |
| utility | 11816 | |
| station | 11816 | |
| sedan | 11508 | |
| wagon/sport | 8964 | |
| passenger | 5970 | 7.5% |
| 2859 | 3.6% | |
| sport | 2852 | 3.6% |
| wagon | 2852 | 3.6% |
| truck | 804 | 1.0% |
| Other values (104) | 5065 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| 47016 | 8.0% | |
| S | 46713 | 8.0% |
| t | 45036 | 7.7% |
| i | 36966 | 6.3% |
| a | 30102 | 5.1% |
| e | 29891 | 5.1% |
| n | 29582 | 5.1% |
| o | 27364 | 4.7% |
| E | 24670 | 4.2% |
| l | 18162 | 3.1% |
| Other values (47) | 249445 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 584947 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 47016 | 8.0% | |
| S | 46713 | 8.0% |
| t | 45036 | 7.7% |
| i | 36966 | 6.3% |
| a | 30102 | 5.1% |
| e | 29891 | 5.1% |
| n | 29582 | 5.1% |
| o | 27364 | 4.7% |
| E | 24670 | 4.2% |
| l | 18162 | 3.1% |
| Other values (47) | 249445 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 584947 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 47016 | 8.0% | |
| S | 46713 | 8.0% |
| t | 45036 | 7.7% |
| i | 36966 | 6.3% |
| a | 30102 | 5.1% |
| e | 29891 | 5.1% |
| n | 29582 | 5.1% |
| o | 27364 | 4.7% |
| E | 24670 | 4.2% |
| l | 18162 | 3.1% |
| Other values (47) | 249445 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 584947 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 47016 | 8.0% | |
| S | 46713 | 8.0% |
| t | 45036 | 7.7% |
| i | 36966 | 6.3% |
| a | 30102 | 5.1% |
| e | 29891 | 5.1% |
| n | 29582 | 5.1% |
| o | 27364 | 4.7% |
| E | 24670 | 4.2% |
| l | 18162 | 3.1% |
| Other values (47) | 249445 |
MISSING 
| Distinct | 71 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 2076637 |
| Missing (%) | 99.6% |
| Memory size | 64.0 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 30 |
| Mean length | 18.21452 |
| Min length | 2 |
Characters and Unicode
| Total characters | 161326 |
|---|---|
| Distinct characters | 54 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Station Wagon/Sport Utility Vehicle |
|---|---|
| 2nd row | Station Wagon/Sport Utility Vehicle |
| 3rd row | Sedan |
| 4th row | Sedan |
| 5th row | Station Wagon/Sport Utility Vehicle |
| Value | Count | Frequency (%) |
| vehicle | 4048 | |
| utility | 3354 | |
| station | 3354 | |
| sedan | 3214 | |
| wagon/sport | 2552 | |
| passenger | 1487 | 6.8% |
| 804 | 3.7% | |
| wagon | 804 | 3.7% |
| sport | 802 | 3.7% |
| truck | 248 | 1.1% |
| Other values (69) | 1201 | 5.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 13021 | 8.1% | |
| t | 12829 | 8.0% |
| S | 12811 | 7.9% |
| i | 10525 | 6.5% |
| a | 8498 | 5.3% |
| e | 8443 | 5.2% |
| n | 8378 | 5.2% |
| o | 7809 | 4.8% |
| E | 6129 | 3.8% |
| l | 5170 | 3.2% |
| Other values (44) | 67713 |
Most occurring categories
| Value | Count | Frequency (%) |
| (unknown) | 161326 |
Most frequent character per category
(unknown)
| Value | Count | Frequency (%) |
| 13021 | 8.1% | |
| t | 12829 | 8.0% |
| S | 12811 | 7.9% |
| i | 10525 | 6.5% |
| a | 8498 | 5.3% |
| e | 8443 | 5.2% |
| n | 8378 | 5.2% |
| o | 7809 | 4.8% |
| E | 6129 | 3.8% |
| l | 5170 | 3.2% |
| Other values (44) | 67713 |
Most occurring scripts
| Value | Count | Frequency (%) |
| (unknown) | 161326 |
Most frequent character per script
(unknown)
| Value | Count | Frequency (%) |
| 13021 | 8.1% | |
| t | 12829 | 8.0% |
| S | 12811 | 7.9% |
| i | 10525 | 6.5% |
| a | 8498 | 5.3% |
| e | 8443 | 5.2% |
| n | 8378 | 5.2% |
| o | 7809 | 4.8% |
| E | 6129 | 3.8% |
| l | 5170 | 3.2% |
| Other values (44) | 67713 |
Most occurring blocks
| Value | Count | Frequency (%) |
| (unknown) | 161326 |
Most frequent character per block
(unknown)
| Value | Count | Frequency (%) |
| 13021 | 8.1% | |
| t | 12829 | 8.0% |
| S | 12811 | 7.9% |
| i | 10525 | 6.5% |
| a | 8498 | 5.3% |
| e | 8443 | 5.2% |
| n | 8378 | 5.2% |
| o | 7809 | 4.8% |
| E | 6129 | 3.8% |
| l | 5170 | 3.2% |
| Other values (44) | 67713 |
| CRASH DATE | CRASH TIME | BOROUGH | ZIP CODE | LATITUDE | LONGITUDE | LOCATION | ON STREET NAME | CROSS STREET NAME | OFF STREET NAME | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | CONTRIBUTING FACTOR VEHICLE 2 | CONTRIBUTING FACTOR VEHICLE 3 | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | VEHICLE TYPE CODE 3 | VEHICLE TYPE CODE 4 | VEHICLE TYPE CODE 5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 09/11/2021 | 2:39 | NaN | NaN | NaN | NaN | NaN | WHITESTONE EXPRESSWAY | 20 AVENUE | NaN | 2.0 | 0.0 | 0 | 0 | 0 | 0 | 2 | 0 | Aggressive Driving/Road Rage | Unspecified | NaN | NaN | NaN | 4455765 | Sedan | Sedan | NaN | NaN | NaN |
| 1 | 03/26/2022 | 11:45 | NaN | NaN | NaN | NaN | NaN | QUEENSBORO BRIDGE UPPER | NaN | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Pavement Slippery | NaN | NaN | NaN | NaN | 4513547 | Sedan | NaN | NaN | NaN | NaN |
| 2 | 06/29/2022 | 6:55 | NaN | NaN | NaN | NaN | NaN | THROGS NECK BRIDGE | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Following Too Closely | Unspecified | NaN | NaN | NaN | 4541903 | Sedan | Pick-up Truck | NaN | NaN | NaN |
| 3 | 09/11/2021 | 9:35 | BROOKLYN | 11208.0 | 40.667202 | -73.866500 | (40.667202, -73.8665) | NaN | NaN | 1211 LORING AVENUE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4456314 | Sedan | NaN | NaN | NaN | NaN |
| 4 | 12/14/2021 | 8:13 | BROOKLYN | 11233.0 | 40.683304 | -73.917274 | (40.683304, -73.917274) | SARATOGA AVENUE | DECATUR STREET | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | 4486609 | NaN | NaN | NaN | NaN | NaN |
| 5 | 04/14/2021 | 12:47 | NaN | NaN | NaN | NaN | NaN | MAJOR DEEGAN EXPRESSWAY RAMP | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4407458 | Dump | Sedan | NaN | NaN | NaN |
| 6 | 12/14/2021 | 17:05 | NaN | NaN | 40.709183 | -73.956825 | (40.709183, -73.956825) | BROOKLYN QUEENS EXPRESSWAY | NaN | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing Too Closely | Unspecified | NaN | NaN | NaN | 4486555 | Sedan | Tractor Truck Diesel | NaN | NaN | NaN |
| 7 | 12/14/2021 | 8:17 | BRONX | 10475.0 | 40.868160 | -73.831480 | (40.86816, -73.83148) | NaN | NaN | 344 BAYCHESTER AVENUE | 2.0 | 0.0 | 0 | 0 | 0 | 0 | 2 | 0 | Unspecified | Unspecified | NaN | NaN | NaN | 4486660 | Sedan | Sedan | NaN | NaN | NaN |
| 8 | 12/14/2021 | 21:10 | BROOKLYN | 11207.0 | 40.671720 | -73.897100 | (40.67172, -73.8971) | NaN | NaN | 2047 PITKIN AVENUE | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inexperience | Unspecified | NaN | NaN | NaN | 4487074 | Sedan | NaN | NaN | NaN | NaN |
| 9 | 12/14/2021 | 14:58 | MANHATTAN | 10017.0 | 40.751440 | -73.973970 | (40.75144, -73.97397) | 3 AVENUE | EAST 43 STREET | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Passing Too Closely | Unspecified | NaN | NaN | NaN | 4486519 | Sedan | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| CRASH DATE | CRASH TIME | BOROUGH | ZIP CODE | LATITUDE | LONGITUDE | LOCATION | ON STREET NAME | CROSS STREET NAME | OFF STREET NAME | NUMBER OF PERSONS INJURED | NUMBER OF PERSONS KILLED | NUMBER OF PEDESTRIANS INJURED | NUMBER OF PEDESTRIANS KILLED | NUMBER OF CYCLIST INJURED | NUMBER OF CYCLIST KILLED | NUMBER OF MOTORIST INJURED | NUMBER OF MOTORIST KILLED | CONTRIBUTING FACTOR VEHICLE 1 | CONTRIBUTING FACTOR VEHICLE 2 | CONTRIBUTING FACTOR VEHICLE 3 | CONTRIBUTING FACTOR VEHICLE 4 | CONTRIBUTING FACTOR VEHICLE 5 | COLLISION_ID | VEHICLE TYPE CODE 1 | VEHICLE TYPE CODE 2 | VEHICLE TYPE CODE 3 | VEHICLE TYPE CODE 4 | VEHICLE TYPE CODE 5 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2085484 | 03/05/2024 | 20:40 | QUEENS | 11375.0 | 40.722622 | -73.849144 | (40.722622, -73.849144) | YELLOWSTONE BOULEVARD | GERARD PLACE | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Driver Inattention/Distraction | Unspecified | NaN | NaN | NaN | 4707384 | Sedan | Tractor Truck Diesel | NaN | NaN | NaN |
| 2085485 | 03/05/2024 | 7:30 | NaN | NaN | 40.772953 | -73.920280 | (40.772953, -73.92028) | 26 STREET | HOYT AVENUE NORTH | NaN | 0.0 | 0.0 | 0 | 0 | 0 | 0 | 0 | 0 | Turning Improperly | Driver Inattention/Distraction | NaN | NaN | NaN | 4707737 | Box Truck | Garbage or Refuse | NaN | NaN | NaN |
| 2085486 | 03/05/2024 | 14:50 | NaN | NaN | 40.646000 | -73.971750 | (40.646, -73.97175) | CHURCH AVENUE | EAST 8 STREET | NaN | 2.0 | 0.0 | 2 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN | NaN | NaN | 4707432 | NaN | NaN | NaN | NaN | NaN |
| 2085487 | 03/05/2024 | 14:00 | NaN | NaN | 40.722250 | -74.005920 | (40.72225, -74.00592) | CANAL STREET | AVENUE OF THE AMERICAS | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Following Too Closely | Following Too Closely | NaN | NaN | NaN | 4707476 | Sedan | NaN | NaN | NaN | NaN |
| 2085488 | 02/06/2024 | 12:37 | BROOKLYN | 11235.0 | 40.586670 | -73.966156 | (40.58667, -73.966156) | OCEAN PARKWAY | AVENUE Z | NaN | 1.0 | 0.0 | 1 | 0 | 0 | 0 | 0 | 0 | Unspecified | NaN | NaN | NaN | NaN | 4707884 | E-Bike | NaN | NaN | NaN | NaN |
| 2085489 | 03/05/2024 | 17:22 | QUEENS | 11436.0 | 40.680477 | -73.792100 | (40.680477, -73.7921) | SUTPHIN BOULEVARD | FOCH BOULEVARD | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Failure to Yield Right-of-Way | Unspecified | NaN | NaN | NaN | 4707511 | Station Wagon/Sport Utility Vehicle | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |
| 2085490 | 03/05/2024 | 17:00 | BROOKLYN | 11204.0 | 40.610786 | -73.978820 | (40.610786, -73.97882) | NaN | NaN | 161 AVENUE O | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Driver Inexperience | Unspecified | Unspecified | Unspecified | NaN | 4707419 | Ambulance | PK | Van | PK | NaN |
| 2085491 | 03/03/2024 | 17:50 | NaN | NaN | 40.675053 | -73.947235 | (40.675053, -73.947235) | SAINT MARKS AVENUE | NaN | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Aggressive Driving/Road Rage | Unspecified | NaN | NaN | NaN | 4707855 | Station Wagon/Sport Utility Vehicle | PK | NaN | NaN | NaN |
| 2085492 | 03/05/2024 | 14:30 | BROOKLYN | 11207.0 | 40.677900 | -73.892586 | (40.6779, -73.892586) | MILLER AVENUE | FULTON STREET | NaN | 1.0 | 0.0 | 1 | 0 | 0 | 0 | 0 | 0 | Pedestrian/Bicyclist/Other Pedestrian Error/Confusion | NaN | NaN | NaN | NaN | 4707872 | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN | NaN |
| 2085493 | 03/05/2024 | 8:00 | QUEENS | 11385.0 | 40.706512 | -73.878136 | (40.706512, -73.878136) | EDSALL AVENUE | 73 STREET | NaN | 1.0 | 0.0 | 0 | 0 | 0 | 0 | 1 | 0 | Failure to Yield Right-of-Way | Unspecified | NaN | NaN | NaN | 4707447 | Sedan | Station Wagon/Sport Utility Vehicle | NaN | NaN | NaN |